Scientific Article Summarization Using Citation-Context and Article's Discourse Structure
نویسندگان
چکیده
We propose a summarization approach for scientific articles which takes advantage of citation-context and the document discourse model. While citations have been previously used in generating scientific summaries, they lack the related context from the referenced article and therefore do not accurately reflect the article’s content. Our method overcomes the problem of inconsistency between the citation summary and the article’s content by providing context for each citation. We also leverage the inherent scientific article’s discourse for producing better summaries. We show that our proposed method effectively improves over existing summarization approaches (greater than 30% improvement over the best performing baseline) in terms of ROUGE scores on TAC2014 scientific summarization dataset. While the dataset we use for evaluation is in the biomedical domain, most of our approaches are general and therefore adaptable to other domains.
منابع مشابه
Towards Citation-Based Summarization of Biomedical Literature
Citation-based summarization is a form of technical summarization that uses citations to an article to form its summary. In biomedical literature, citations by themselves are not reliable to be used for summary as they fail to consider the context of the findings in the referenced article. One way to remedy such problem is to link citations to the related text spans in the reference article. Th...
متن کاملTowards Multi-Document Summarization of Scientific Articles:Making Interesting Comparisons with SciSumm
We present a novel unsupervised approach to the problem of multi-document summarization of scientific articles, in which the document collection is a list of papers cited together within the same source article, otherwise known as a co-citation. At the heart of the approach is a topic based clustering of fragments extracted from each co-cited article and relevance ranking using a query generate...
متن کاملSciSumm: A Multi-Document Summarization System for Scientific Articles
In this demo, we present SciSumm, an interactive multi-document summarization system for scientific articles. The document collection to be summarized is a list of papers cited together within the same source article, otherwise known as a co-citation. At the heart of the approach is a topic based clustering of fragments extracted from each article based on queries generated from the context sur...
متن کاملBook Review: The Structure of Scientific Articles: Applications to Citation Indexing and Summarization by Simone Teufel
Discourse models have received significant attention in the computational linguistics community with some important connections to the non-computational discourse community. More recently, the importance of discourse annotation has increased as models generated with supervised machine learning techniques are being used to annotate text automatically. A primary area for annotation is science. Th...
متن کاملIdentifying Non-Explicit Citing Sentences for Citation-Based Summarization
Identifying background (context) information in scientific articles can help scholars understand major contributions in their research area more easily. In this paper, we propose a general framework based on probabilistic inference to extract such context information from scientific papers. We model the sentences in an article and their lexical similarities as a Markov Random Field tuned to det...
متن کامل